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Full Record 



Details for HUGENEFL:M64347_AT 
Fullscreen 

Cluster Members 
Consensus/Exemplar 

GeneChip Array Information 

Probe Set ID M64347_at 

GeneChip 
Array 

Organism 
Common 
Name 



-> START } 



NetAffx Links 



HumanGeneFL Array 
Human 

Probe Design Information 



Transcript ID M64347 
Se Type° e Exemplar sequence 

Represen^ive M64347 NCB| 
Public ID 

Target M64347, class A, 20 probes, 20 in M64347 3336-3720, Human novel growth facto 
Description receptor mRNA, 3' cds 



Assembly 
Alignment(s) 



Overlapping 
Transcripts 



Genomic Alignment of Target Sequence 
April 2003 (NCBI 33) 

Position % Identity Cytoband 

chr4: 1771773-1772182 (+) UCSC 93 . p16.3 



UniGene Description 



Representative 
Transcript 

My nnni/10 fibroblast growth factor receptor 3 

r,^Q, (achondroplasia, thanatophoric 

^ dwarfism) 

mm nooofi* fibroblast growth factor receptor 3 

m^d. (achondroplasia, thanatophoric 
dwarfism) 



Position 

chr4:1 757261- . 
1772237 (+) UCSC 

chr4: 1757261- 
1772237 (+) UCSC 



Public Domain and Genome References 
Gene Title fibroblast growth factor receptor 3 (achondroplasia/thanatophoric dwarfism) 
Gene Symbol FGFR3 HGNC 

Chromosomal . 
Location 4p16 ' 3 
UniGene ID Hs.1420 NCBI (FULL LENGTH) 
Ensembl ENSG00000068078 Ensembl 
LocusLink 2261 NCBI 



P22607 EMBL-EBI 
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Swiss Prot 

EC 
OMIM 



Q96T34 EMBL-EBI 
Q96T35 EMBL-EBI 
Q96T36 EMBL-EBI 
Q9NRB6 EMBL-EBI 

2.7.1.112 

134934 NCBI 



RefSeq Protein NP_000133 NCBI 
ID NP 075254 NCBI 



RefSeq 



Ortholog 



RefSeq Transcript ID RefSeq Title 

NM_000142 NCBI fibroblast growth factor receptor 3 isoform 1 precursor 

NM_022965 NCBI fibroblast growth factor receptor 3 isoform 2 precursor 

Functional Annotations 

ID Title 

DROSGENOME1:143549 AT breathless 



fibroblast growth factor Rat 
receptor 3 

fibroblast growth factor Rat 
receptor 3 

fibroblast growth factor Rat 
receptor 3 

RG-U34B:RC AA899336 AT fibroblast growth factor Rat 



RAE230A: 1369373 AT 
RAE230B:1384056 AT 
RAE230B:1384829 AT 



RG-U34C:RC AM 36304 AT 
RG-U34C:RC Al 145424 AT 
MG-U74AV2:160919 R AT 
MG-U74AV2:162253 I AT 
MOE430A:1421841 AT 
MQE430A: 1425796 A AT 



receptor 3 

fibroblast growth factor Rat 
receptor 3 

fibroblast growth factor Rat 
receptor 3 

fibroblast growth factor Mouse 
receptor 3 

fibroblast growth factor Mouse 
receptor 3 

fibroblast growth factor Mouse 
receptor 3 

fibroblast growth factor Mouse . 



receptor 3 

MU11KSUBA:M81342 S AT fibroblast growth factor Mouse 

receptor 3 

MOUSE430 2:1421841 AT fibroblast growth factor Mouse 

receptor 3 

MOUSE43Q 2:1425796 A AT fibroblast growth factor Mouse 

receptor 3 

MOUSE430A 2:1421841 AT fibroblast growth factor Mouse 

receptor 3 

MOUSE430A 2:1425796 A AT fibroblast growth factor Mouse 

receptor 3 



Organism Type 

Drosophila Putative 
Ortholog 

Putative 
Ortholog 

Putative 
Ortholog 

Putative 
Ortholog 

Putative 
Ortholog 

Putative 
Ortholog 

Putative 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 



GO Biological Process (view graph) 

ID Description 
165 MAPKKK cascade 

1501 skeletal development 

7048 oncogenesis 

7259 JAK-STAT cascade 



Evidence 

experimental 
evidence 

predicted/computed 

experimental 
evidence 

experimental 



Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

QuickGO 
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8543 FGF receptor signaling pathway 

GO Cellular Component (view graph) 
ID Description 
Gene Ontology 5887 integral to plasma membrane 

GO Molecular Function (view graph) 

ID Description 

5007 fibroblast growth factor receptor 
activity 



evidence 

experimental 
evidence 



Evidence 

experimental 
evidence 



Evidence 

experimental 
evidence 



AmiGO 

QuickGO 
AmiGO 

Links 

QuickGO 
AmiGO 

Links 

QuickGO 
AmiGO 



Method ID 
blast 13112048 



blast 



13186255 



0.0 



0.0 



Protein 
Similarities 



blast 



blast 



4503711 



Description E-Value 

fibroblast growth factor receptor i3 
isoform 2 precursor; hydroxyaryl- 
protein kinase; tyrosine kinase JTK4 
[Homo sapiens] 

fibroblast growth factor receptor 2 
isoform 3 precursor; keratinocyte 
growth factor receptor; K-sam 
protein; protein tyrosine kinase, 
receptor like 14; FGF receptor; 
bacteria-expressed kinase; fibroblast 
growth factor receptor BEK; 
tyrosylprotein kinase; hydroxyaryl- 
protein kinase [Homo sapiens] 
fibroblast growth factor receptor 3 0.0 
isoform 1 precursor; hydroxyaryl- 
protein kinase; tyrosine kinase JTK4 . 
[Homo sapiens] 



20452380 



0.0 



Method ID 

Hanks FGFR-3 



Protein 
Families 



ec 



Hanks FGFR-3 



ec 



Description 

FGR3_HUMAN (FGFR-3) KINASES:5.6.3 | 
PTK Group B membrane spanning protein 
tyrosine kinases.PTK XV Fibroblast growth 
factor receptor family .FGFR-3 

ZA70 HUMAN ZA70_HUMAN EC:2.7.1 .1 12:TYROSlNE- 

PROTEIN KINASE ZAP-70 (EC 2.7.1.1 12) (70 
KDA 2ETA-ASSOCIATED PROTEIN) (SYK- 
RELATED TYROSINE KINASE). 

FGR3_HUMAN (FGFR-3) KINASES:5.6.3 | 
PTK Group B membrane spanning protein 
tyrosine kinases.PTK XV Fibroblast growth 
factor receptor family .FGFR-3 

ZA70 HUMAN ZA70JHUMAN EC:2.7.1 .1 12:TYROSINE- 

PROTEIN KINASE ZAP-70 (EC 2.7.1.112) (70 
KDA ZETA-ASSOCIATED PROTEIN) (SYK- 
RELATED TYROSINE KINASE). 



E-Value 

1.0E- . 
166 



7.38E- 
99 



1.0E- 
167 



7.38E- 
99 



Database 


lb 


Description 


E-Value 


scop 


dlgjda 


d1gjoa_SCOP:d. 144.1. 2:| Fibroblast growth 


3.81 E- 




factor receptor 2 


81 


scop 


d1ev2e1 


d1ev2e1 SCOP:b.1.1.4:| Fibroblast growth 


4.95E- 




factor receptor, FGFR 


21 


scop 


dlgjoa 


d1gjoa_SCOP:d.144.1.2:| Fibroblast growth 


3.81 E- 




factor receptor 2 


81 
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Protein 
Domains 



scop 


d1ev2e1 


d1ev2e1 SC0P:b.1.1.4:| Fibroblast growth 


4.25E- 




factor receptor, FGFR 




pfam 




Immunoglobulin domain 


1 .Dt-0 


pfam 


ig 


Immunoglobulin domain 


3.2E-8 


pfam 


pkinase 


Protein kinase domain 




pfam 


is 


Immunoglobulin. domain 


1.6E-5 


pfam 


ia 


Immunoglobulin domain 


3.2E-8 


pfam 


pkinase 


Protein kinase domain 


2.3E-92 


pfam 




Immunoglobulin domain 


7.3E-8 


IntprPrn 


IPR000719 

II 1 WUU / 1 fc7 

EMBL-EBI 


Protein kinase 




IntprPrn 

II HOI 1 1 \J 


IPR007110 

1 1 1 \wv 1 1 1 \J 

EMBL-EBI 


Immunoalobulin-like 

II III 1 IUI is_/y IW wum l jh *v 




interrro 


IDDHH1 OAR 

EMBL-EBI 


i yrubine jjruioin Kinase 




InterPro 


IPR008266 
EMBL-EBI 


Tyrosine protein kinase, active site 




InferPro 


IPR003598 
EMBL-EBI 


Immunoglobulin C-2 type 





Target 
Sequence 



Trans Membrane 

ID 

NP 000133 



Number Of 
Domains 



Probability of Interior N-Terminus 

0.11005 



Sequence 

>HUGENE FL : M6 4 3 4 7_AT 

gacttcaaagcaagctggtattttcatacaaattcttctaafctgctgtgtgtcccaggca 
gggagacggtttccagggaggggccggccctgtgtgcaggttccgatgttattagatgtt 
acaagtttatatatatctatatatataatttattgagttt'ttacaagatgtatttgttgt 
agacttaacacttcttacgcaatgcttctagagttttatagcctggactgctacctttca 
aagcttggagggaagccgtgaattcagttggttcgttctgtactgttactgggccctgag 
tctgggcagctgtcccttgcttgcctgcagggccatggctcagggtggtctcttcttggg. 
gcccagtgcatggtggccagaggtgtcacccaaaccggcaggtgcgatt 



Probe Info 



Probe Sequence(5'-3') 

GACTTCAAAGCAAGCTGGTATTTTC 
CATACAAATTCTTCTAATTGCTGTG 
AATTCTTCTAATTGCTGTGTGTCCC 
TGCTGTGTGTCCCAGGCAGGGAGAC 
TGTGTGCAGGTTCCGATGTTATTAG 
TCTTACGCAATGCTTCTAGAGTTTT 
GCAATGCTTCTAGAGTTTTATAGCC 
GAGTTTTATAGCCTGGACTGCTACC 
TGCTACCTTTCAAAGCTTGGAGGGA 
AAGCTTGGAGGGAAGCCGTGAATTC 
TGAATTCAGTTGGTTCGTTCTGTAC 
GTTCGTTCTGTACTGTTACTGGGCC 
CTGGGCCCTGAGTCTGGGCAGCTGT 
CCTGAGTCTGGGCAGCTGTCCCTTG 
TCTGGGCAGCTGTCCCTTGCTTGCC 
TCCCTTGCTTGCCTGCAGGGCCATG 



Probe Probe 


Probe 

Interrogation strandedness 


A 


T 


Position 




359 


161 


3348 


Antisense 


360 


161 


3372 


Antisense 


361 


161 


3378 


Antisense 


362 


161 


3390 


Antisense 


363 


161 


3438 


Antisense 


364 


161 


3540 


Antisense 


365 


161 


3546 


Antisense 


366 


161 


3558 


Antisense 


367 


161 


3576 


Antisense 


368 


161 


3588 


Antisense 


369 


161 


3606 


Antisense 


370 


161 


3618 


Antisense 


371 


161 


3636 


Antisense 


372 


161 


3642 


. Antisense 


373 


161 


3648 


Antisense 


374 


161 


3660 


Antisense 
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GCTTGCCTGCAGGGCCATGGCTCAG 375 161 3666 Antisense 

CTTGGGGCCCAGTGCATGGTGGCCA 376 161 3702 Antisense 

GTGGCCAGAGGTGTCACCCAAACCG 377 161 3720 Antisense 

GTCACCCAAACCGGCAGGTGCGATT 378 161 3732 Antisense 
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PubMed 



Sequence Revision History 

Entrez BLAST OMIM 




Taxonomy 



SITE 
MAP 

Structure 



NCBI Home 

Site Map 

brief and complete 

versions; 

About NCBI 
general and contact 
information 

GenBank 
submit your 
sequence, general 
information 

Molecular 
Databases 
nucleotides, 
proteins, structures 
and taxonomy 

Literature 
Databases 
PubMed, PubRef, 
OMIM, Citation 
Matcher 

Genomes and 
Maps 

maps, the human 
genome and mode! 
organisms 

Tools 

for data mining and 
analysis 

Research at 
NCBI 

people and projects 

Software 
Engineering 
.Tools, R&D and 
databases 

Education 

teaching resources 
and on-line tutorials 



The Sequence Revision History tool allows you to see the 
various gi numbers, version numbers, and update dates for 
sequences that appeared in a specific GenBank record. 

E.g., search for U46667 in the tool to see the old and current 
identifiers of the nucleotide sequence in that record. 

Note that the original gi number for the nucleotide sequence, 
2734632, does not have a corresponding version number. This 
is true because it was removed from the database (and replaced 
by 3172140) before the new accession. verion system was 
implemented in Feb. 1999. At that time, each sequence in the 
GenBank/EMBL/DDBJ database received a version number of 1 , 
even if they had been updated in the past. 

In addition, if a GenBank record contains an updated sequence, 
the Comment field will contain a cross-reference to the gi 
number of the earlier sequence. (E.g., see U46667 in Entrez.) If 
you follow the link for that earlier gi number, Entrez will display 
that version of the GenBank record. Similarly, the Comment field 
of the older version will have a warning that the sequence has 
been updated, and will contain a cross-reference to the newer 
version. 



More details about sequence identification numbers (GI and 
accession.version). 

Back to sample record . 



FTP site 
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download data and 
software 

Help Desk NCBI NLM NIH Credits 

Revised October l f 2003 

Questions about NCBI resources to info@ncbi.nlm.nih.gov 
Comments about site map to Renata Geer renata@ncbi.nlm.nih. gov 



Vitttv//www nr.hi.n1iTi.riih.gov/Siternao/seauencerevisionhistorv.htnil 
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PubMed Nucleotide Protein Geno me S tructure 

Find (Accessions, GI numbers or Fasta style Seq/ds)jU46667 

About Entrez 

- 



Sequence Revision History 

PMC Taxonomy 



OMIM 



Entrez 



Search for Genes 

LocusLink provides curated 
information for human, fruit 
fly, mouse, rat, and 
zebrafish 

Help|FAQ 

Batch Entrez: Upload a 
file of GI or accession 
numbers to retrieve 
protein or nucleotide 
sequences 

Check sequence 
revision history 

How to create WWW 
links to Entrez 

LinkOut 

Cubby 



|§§jf| difference between I and II as jGenBank/GenPept 
Revision history for U46667 



GI 


Version 


Update Date 


Status 


I 


II 


j 3172140 


1 


Alia 7 1998 9:28 AM 


Live 


(•■ 


G 


3172140 


1 


Jun 2 19984:31 PM : 


Dead 


f) 


& 


2734632 


n/a 


Jan 3 1998 12:12 AM 


Dead 


o 




2734632 


n/a 


Jan 1 1998 12:30 AM 


Dead 


0 





Accession U46667 was first seen at NCBI on Jan 1 1998 12:30 AM 



Related resources 

BLAST 

Reference sequence project 
LocusLink 

Clusters of ortnologous groups 
Protein reviews on the web 



Disclaimer | Write to the Help Desk 
NCBI I NLM | NIH 



V»ttn-/Av\AAvnr.hi n1m.nih.frov/entre7/sutils/eirevhistxei?val-U46667 
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AFFYMETRIX, 



The new GeneChip* One-Cycle and 
Two-Cycle cDNA Synthesis Kits. 




PRODUCTS _ 

" GETTING 
STARTED 

-> Wizard 



QUERY 

Expression 

~> Quick Query 
-> Standard Query 
-> Batch Query 
-> BLAST 
-> Probe Match 
-> UCSC Query 
Genotyping 
-> Quick Query 
-> Standard Query 
-> Batch Query 
■-> UCSC Query 
-> SNP Finder 

:: CURRENT QUERY 
1 probe sets 

-> Annotations 
-> Show Orthologs 
-> GO Browser 
-> Export 

:: QUERY HISTORY 

Annotation Views 

-> Expression 
~> Genotyping 

-> BLAST Status 

-> New Folder 
-> Expression 
Queries 

->(1)AII Descriptions 

(L17131) 
-> (O)Aii Descriptions 

(L17131jrnaMat) 
-> (O)AII Descriptions 

(L17131_rnaLat) 
->(1)A!I Descriptions 

(m64347) 
-> all probe sets 

(7129) 
-> Genotyping 
Queries 
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Full Record 



Details for HUGENEFL:L17131_RNA1_AT 

Full Screen 

Cluster Members 
Consensus/Exemplar 

GeneChip Array Information 

L17131 rna1 at 



-> START i 



NetAffx Links 



HumanGeneFL Array 



Human 



Probe Design Information 



L17131jna1 
Exemplar sequence 



Probe Set ID 
GeneChip 
Array 

Organism 
Common 
Name 



Transcript ID 

Sequence 
Type 

Target L1 71 31, class A, 20 probes, 20 in L17131mRNA#1 1646-2198, Human high 

Description mobility group protein (HMG-i(Y)) gene exons 1-8, complete cds 

Sequence 

>HUGENEFL : Ll 713 1_RNA1_AT 

ttgtccaggtgaggcccaagagccctgtggccgccacctgaggtgggctggggctgctcc 
cctaaccctactttcgttccgccactcagccatttccccctcctcagatggggcaccaat 
.aacaaggagctcaccctgcccgctcccaacccccctcctgctcctccctgcccccc'aagg 
ttctggttccatttttcctctgttcacaaactacctctggacagttgtgttgttttttgt 
Target tcaatgttccattcttcgacatccgtcattgctgctgctaccagcgccaaatgttcatCG 
Sequence tcattgcctcctgttctgcccacgatcccctcccccaagatactGtttgtggggaagagg 
ggctggggcatggcaggctgggtgaccgactaccccagtcccagggaaggtggggccctg 
cccctaggatgctgcagcagagtgagcaagggggcccgaatcgaccataaagggtgtagg. 
ggccacctcctccccctgttctgttggggaggggtagccatgatttgtcccagcctgggg 
ctccctctctggtttcctatttgcagttacttgaata 



Probe Sequence(5'-3'; 



Probe Probe 
X Y 



TTGTCCAGGTGAGGCCCAAGAGCCC 294 101 

AGGTGAGGCCCAAGAGCCCTGTGGC 295 101 

ACCAATAACAAGGAGCTCACCCTGC 296 101 

TTTTCCTCTGTTCACAAACTACCTC 297 101 

CTACCTCTGGACAGTTGTGTTGTTT 298 101 

TTCCATTCTTCGACATCCGTCATTG 299 101 

TCTTCGACATCCGTCATTGCTGCTG 300 101 



Probe 
Interrogation 

Position 

1658 
1664 

1772 ; 

1850 
1868 
1904 
1910 



Strandednes; 

Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
Antisense 
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GCTACCAGCGCCAAATGTTCATCCT 


301 


A f\A 

101 


1934 


Antisense 


TCATCCTCATTGCCTCCTGTTCTGC 


302 


A f\A 

101 


a no 
1952 


Antisense 


TCATTGCCTCCTGTTCTGCCCACGA 


303 


101 


a nco 

1958 


Antisense 


AAGATACTCTTTGTGGGGAAGAGGG 


304 


101 


a nh a 

1994 


Antisense 


GCAGGCTGGGTGACCGACTACCCCA 


305 


101 


2030 


Antisense 


CCCCTAGGATGCTGCAGCAGAGTGA 


306 


101 


2078 


Antisense 


Probe Info AGCAAGGGGGCCCGAATCGACCATA 


307 


A f\ A 

101 


2102 


Antisense 


CGAATCGACCATAAAGGGTGTAGGG 


308 


101 


2114 


Antisense 


GCCATGATTTGTCCCAGCCTGGGGC 


309 


101 


2174 


Antisense 


CTGGGGCTCCCTCTCTGGTTTCCTA 


310 


101 


2192 


Antisense 


CTCCCTCTCTGGTTTCCTATTTGCA 


311 


101 


2198 


Antisense 


CTCTGGTTTCCTATTTGCAGTTACT 


312 


101 


2204 


Antisense 


TTTCCTATTTGCAGTTACTTGAATA 


313 


101 


2210 


Antisense 
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AFFYMETRIX, 

Ay:--: 



products BEEE 



:: GETTING 
STARTED 

-> Wizard 

« QUERY 

Expression 

-> Quick Query 
-> Standard Query 
-> Batch Query 
. -> BLAST 
-> Probe Match 
-> UCSC Query 
Genotyping 
-> Quick Query 
-> Standard Query 
-> Batch Query 
-> UCSC Query 
->SNP Finder 

:: CURRENT QUERY 
1 probe sets 

-> Annotations 
-> Show Orthologs 
-> GO Browser 
-> Export 

:: QUERY HISTORY 

Annotation Views 

-> Expression 
-> Genotyping 

-> BLAST Status 

-> New Folder 
-> Expression 
Queries 

->(1)AH Descriptions 

(X74801) 
->(1)AII Descriptions 

(L17131) 
->(0)AII Descriptions 

(L17131jrnal_at) 
-> (O)AII Descriptions 
... (L17131jnaLat) 
->(1)AII Descriptions 

(m64347) 
-> Genotyping 
Queries 
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Full Record 



Details for HUGENEFL:X74801_AT 
Full Screen 

Cluster Members 
Consensus/Exemplar 

GeneChip Array Information 

X74801 at 



-> START ^ 



NetAffx Links 



Probe Set ID 
GeneChip 
Array 

Organism 
Common 
Name 



HumanGeneFL Array 



Human 



Probe Design Information 

Transcript ID X74801 
^Type^ Exemplar sequence 

-pSSid 1 ^ X748Q1 ncbi 



Target 
Description 



Assembly 
Alignment(s) 



Overlapping 
Transcripts 



X74801, class B, 20 probes, 12 in X74801cds 1282-1552: 8 in reverseSequence, 
1636-1837, H.sapiens Cctg mRNA for chaperonin 

Genomic Alignment of Target Sequence 

April 2003 (NCBI 33) 

Position % Identity Cytoband 



q22 



Gene Title 

Gene Symbol CCT3 HGNC 

Chromosomal 
Location 

UniGene ID 

Ensembt 

LocusLink 



chrl : 153495555-153497649 (-) UCSC 100 

Representative UniGene Descriptjon 

Transcript 

NM_005998 chaperonin containing TCP1 , 

NCBI subunit 3 (gamma) 

Public Domain and Genome References 
chaperonin containing TCP1, subunit 3 (gamma) 



Position 



chrl : 1 53495551 -1 53524840 
(-) UCSC 



SwissProt 
OMIM 



1q23 

Hs.1 708 NCBI (FULL LENGTH) 
ENSG000001634B8 Ensembl 

7203 ncbi 
aah06501 embuebi 

P49368 EMBL-EBI 
600114 NCBI 



RefSeq^Protein Np ^ 00 5989 NCBI 
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RefSeq 



RefSeq Transcript ID RefSeq Title 

NM_005998 NCBI chaperonin containing TCP1 . subunit 3 (gamma) 

Functional Annotations 

ID 



Ortholog 



ATH1-1 21 501:246830 AT . 
ATGENOME1: 18906 AT : 
DROSGENOME1:153982 AT 
MG-U74AV2:161238 F AT 
MG-U74AV2:98153 AT 
MG-U74CV2:171548- AT 
MOE430A:1416024 X AT 
MQE430A:1 426067 X AT 
MOE430A:1448178 A AT 
MOE430A:1449645 S AT 
MOE430A:1451915 AT 
MOE430A: 1459987 S AT 



Title Organism Type 

chaperonin, putative Arabidopsis Putative 

Ortholog 

chaperonin, putative Arabidopsis Putative 

Ortholog 

Drosophila Putative 
Ortholog 

chaperonin subunit 3 Mouse 
(gamma) 

chaperonin subunit 3 Mouse 
(gamma) 

chaperonin subunit 3 Mouse 
(gamma) 

chaperonin subunit 3 Mouse 
(gamma) 

chaperonin subunit 3 Mouse 
(gamma) 

chaperonin subunit 3 Mouse 
(gamma) 

chaperonin subunit 3 Mouse 
(gamma) 

chaperonin subunit 3 Mouse 
(gamma) 

chaperonin subunit 3 Mouse 



(gamma) 

MU11KSUBA:C79428 RC F AT chaperonin subunit 3 Mouse 

(gamma) 

MU11KSUBA:L20509 F AT chaperonin subunit 3 Mouse 

(gamma) 

MOUSE430 2:1416024 X AT chaperonin subunit 3 Mouse 

(gamma) 

MOUSE43Q 2:1426067 X AT chaperonin subunit 3 Mouse 

(gamma) 

MOUSE430 2:1448178 A AT chaperonin subunit 3 Mouse 

(gamma) 

MOUSE430 2:1449645 S AT chaperonin subunit 3 Mouse 

(gamma) 

MOUSE430 2:1451915 AT chaperonin subunit 3 Mouse 

(gamma) 

MOUSE43Q 2:1459987 S AT chaperonin subunit 3 Mouse 

(gamma) 

MOUSE430A 2:1416024 X AT chaperonin subunit 3 Mouse 

(gamma) 

MOUSE430A 2:1426067 X AT chaperonin subunit 3 Mouse 

(gamma) — 

MOUSE430A 2:1448178 A AT chaperonin subunit 3 Mouse 

(gamma) 

MOUSE430A 2:1449645 S AT chaperonin subunit 3 Mouse 

(gamma) 

MOUSE430A 2:1451915 AT chaperonin subunit 3 Mouse 

(gamma) 



Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 
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MOUSE430A 2:1459987 S AT chaperonin subunit 3 Mouse Curated 

(gamma) Ortholog 



GO Biological Process (view graph) 

ID Description Evidence 
6457 protein folding traceable author statement 
GO Cellular Component (view graph) 
ID Description Evidence 
Gene Ontology 5829 cytosol not recorded 

5856 cytoskeleton . traceable author statement 
GO Molecular Function (view graph) 

ID Description Evidence 
3754 chaperone activity traceable author statement 



Links 
QuickGO AmiGO 

Links 
QuickGO AmiGO 
QuickGO AmiGO 

Links 

QuickGO AmiGO 



Protein 
Similarities 



Protein 
Domains 



5524 ATP binding 



inferred from electronic annotation QuickGO AmiGO 



Method 


ID 


Description 


E-Value 


Diast 


33873532 




0.0 


blast 


31542292 


chaperonin containing TCP1, subunit 3 (gamma); 
TCP1 (t-complex-1 j ring complex, polypeptide 5 
[Homo sapiens] 


0.0 


Database 


ID 


Description 


E-Value 


scop 


d1a6da3 


d1a6da3 SCOP:d.56.1.2:| Thermosome 


4.08E- 
25 


scop 


dlqmla 


d1gmla_ SCOP:c.8.5.2:| Thermosome 


1.01E- 
57 


scop 


d1a6da1 


d1a6da1 SCOP:a.129.1.2:| Thermosome 


4.81 E- 
83 


pfam 


cpn60 TCP1 TCP-1/cpn60 chaperonin family 


5.7E- 
210 


InterPro 


IPR002423 
EMBL-EBI 


Chaperonin Cpn60rTCP-1 




InterPro 


IPR001844 
EMBL-EBI 


Chaperonin Cpn60 




InterPro 


IPR002194 
EMBL-EBI 


Chaperonin TCP-1 




InterPro 


IPR008950 
EMBL-EBI 


GroEL-like chaperone, ATPase 





Target 
Sequence 



Sequence 

>HUGENEFL : X7 4 8 0 1_AT 

•atgactggtgtggaacaatggccatacagggctgttgcccaggccctagaggtcattcct 
cgtaccctgatccagaactgtggggccagcaccatccgtctacttacctcccttcgggcc 
aagcacacccaggagaactgtgagacctggggtgtaaatggtgagacgggtactttggtg 
gacatgaaggaactgggcatatgggagccattggctgtgaagctgcagacttataagaca 
gcagtggagacggcagttctgctactgcgaattgatgacatcgtttcaggccacaaaaag 
aaaggcgatgaccagagccggcaaggcggggctcctgatgctggccaggagtgagtgcta 
ggcaaggctacttcaatgcacagaaccagcagagtctccccttttcctgagccagagtgc 
caggaacactgtggacgtctttgttcagaagggatcaggttggggggcagcccGsagtcc 
ctttctgtcccagctcagttttccaaaagacactgacatgtaattcttctctattgtaag 
gtttccatttagtttgcttccgatgattaaatctaagtca : 



Probe Sequence(5'-3') 



Probe Probe 
X Y 



Probe 

Interrogation strandedness 
Position 
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Probe Info 



ATGACTGGTGTGGAACAATGGCCAT 


bU 


o4D 


1 9Q/4 


Anticonco 
MllUbcl Jot? 


GAACAATGGCCATACAGGGCTGTTG 


0 1 


o45 


lOUO 


AFiusense 


CTGATCCAGAACTGTGGGGCCAGCA 


62 


O A C 


loOU 


Aniisense 


CAGAACTGTGGGGCCAGCACCATCC 


63 


345 


loob 


Antisense 


TGTGGGGCCAGCACCATCCGTCTAC 


64 


345 


*t OTTO 

1372 


Antisense 


CTGGGCATATGGGAGCCATTGGCTG 


65 


345 


14ob 


Antisense 


ATATGGGAGCCATTGGCTGTGAAGC 


66 


345 


1492 


Antisense 


GAGCCATTGGCTGTGAAGCTGCAGA . 


67 


345 


■4 <4 no 

1498 


Antisense 


TTGGCTGTGAAGCTGCAGACTTATA 


68 


345 


IOU4 


Antisense 


GAGACGGCAGTTCTGCTACTGCGAA 


69 


345 


104U 


Antisense 


GCAGTTCTGCTACTGCGAATTGATG 


70 


1 A C 

345 


104b 


Aniisense 


ATTGATGACATCGTTTCAGGCCACA 


71 


345 


1564 


Antisense 


GTGCTAGGCAAGGCTACTTCAATGC 


72 


O A C 

345 


1648 


Antisense 


GGCAAGGCTACTTCAATGCACAGAA 


73 


345 


1654 


Antisense 


GCTACTTCAATGCACAGAACCAGCA 


74 


345 


1660 


Antisense 


CACAGAACCAGCAGAGTCTCCCCTT 


75 


345 


1672 


Antisense 


GAGCCAGAGTGCCAGGAACACTGTG 


76 


345 


1702 


Antisense 


CACTGACATGTAATTCTTCTCTATT 


77 


345 


. .1804 


Antisense 


TAGTTTGCTTCCGATGATTAAATCT, 


78 


345 


1843 


Antisense 


GCTTCCGATGATTAAATCTAAGTCA 


79 


345 


1849 


Antisense 
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Full Record 



Details for HUGENEFL:U15008 AT 
Full Screen 

Cluster Members 
Consensus/Exemplar 

GeneChip Array Information 

Probe Set ID U15008_at. 
GeneChip 

Array 
Organism 



-> START 1 



NetAffx Links 



Common 
Name 



HumanGeneFL Array 
Human 

Probe Design Information 



Transcript ID U 15008 

Sequence 
Type 



Exemplar sequence 



Representative CB| 
Public ID 

Target U 15008, class A, 20 probes, 20 in U 15008 25-433, Human S'nRNP core protein 
Description Sm D2 mRNA, complete cds 



Assembly 
Alignment(s) 



Overlapping 
Transcripts 



Gene Title 
Gene Symbol 
Chromosomal 
Location 

UniGene ID 
Ensembl 
LocusLink 
SwissProt 
OMIM 



Genomic Alignment of Target Sequence 
April 2003 (NCBI 33) 

Position % Identity Cytoband 

chr19: 50882580-50883664 (-) UCSC 98 q13.32 



Representative 
Transcript 

NM_004597 
NCBI 



UniGene Description 

small nuclear ribonucleoprotein D2 
polypeptide 16.5kDa 
NM_1 77542 small nuclear ribonucleoprotein D2 
NCBI polypeptide 16.5kDa 

Public Domain and Genome References 
small nuclear ribonucleoprotein D2 polypeptide 16.5kDa 
SNRPD2 HGNC 

19q13.2 

Hs.424327 NCBI (FULL LENGTH) 
ENSG00000 125743 Ensembl 
6633 NCBI 
P43330 EMBL-EBI 
601061 NCBI 



Position 



chrl 9:50882558- 
50887282 (-) UCSC 

chrl 9:50882558- 
50887282 (-) UCSC 
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RefSeq Protein NPJD04588 NCBI 
ID 



RefSeq 



NP_808210NCBi 
RefSeq Transcript ID RefSeq Title 

small nuclear ribonucleoprotein polypeptide D2 
small nuclear ribonucleoprotein polypeptide D2 

Functional Annotations 



NM_004597 NCBI 
NM 177542 NCBI 



Ortholog 



ID 

ATH1 -121501:266482 AT 
C. ELEGANS:1 72931 X AT 

DROSGENOME1:1 53483 AT 
MG-U74AV2:95049 AT 
MOE430A:1 452680 AT 



Title Organism Type 

small nuclear ribonucleo Arabidopsis Putative 



protein D2 -related 

small nuclear 
ribonucleoprotein D2 
like 



Celegans 



Ortholog 
Putative 
Ortholog 



Drosophila Putative 
Ortholog 



small nuclear 
ribonucleoprotein D2 

small nuclear 
ribonucleoprotein D2 

ML)11KSUBA:AA271024 S AT small nuclear 

ribonucleoprotein D2 

MOUSE430 2:1452680 AT small nuclear 

ribonucleoprotein D2 

MOUSE430A 2:1452680 AT small nuclear 

ribonucleoprotein D2 

GO Biological Process (view graph) 

ID Description 
245 spliceosome assembly 



Mouse 
Mouse 
Mouse 
Mouse 
Mouse 



6371 mRNA splicing 

GO Cellular Component (view graph) 
ID Description 
Gene Ontology 5681 spliceosome complex 



Evidence 

traceable author 
statement 

traceable author 
statement 

Evidence 

traceable author 
statement 



5732 small nucleolar ribonucleoprotein inferred from electronic 
complex annotation 

30532 small nuclear ribonucleoprotein traceable author 
complex statement 

GO Molecular Function (view graph) 

ID Description . Evidence 

8248 pre-mRNA splicing factor activity inferred from electronic 

annotation 



Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 

Curated 
Ortholog 



Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

Links 

QuickGO 
AmiGO 



Protein 
Similarities 



Method ID Description 

blast 4759158 small nuclear ribonucleoprotein polypeptide D2; 

snRNP core protein D2 [Homo sapiens] 

26337731 

4759158 small nuclear ribonucleoprotein polypeptide D2; 
snRNP core protein D2 [Homo sapiens] 

26337731 



blast 
blast 

blast 
Database 



scop 



ID Description 
d1b34b d1 b34b_ SCOP:b.38. 1 .1 :| D2 core SNRNP 



E-Value 
1.0E-62 

3.0E-62 
1.0E-62 

3.0E-62 

E-Value 
1.85E- 
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Protein 
Domains 



scop 



protein 28 

d1b34b d1b34b_SCOP:b.38.1.1:| D2 core SNRNP 1.85E- 

protein 28 

pfam LSM LSM domain 1.1E-16 

pfam LSM LSM domain 1.1E-16 

InterPro IPR001163 Small nuclear ribonucleoprotein (Sm protein) 
EMBL-EBl 



Target 
Sequence 



Sequence 

>HUGENEFL : Ul 5 0 0 8_AT 

accatcatgagcctcctcaacaagcccaagagtgagatgaccccagaggagctgcagaag 
cgagaggaggaggaatttaacaccggtccactctctgtgctcacacagtcagtcaagaac 
aatacccaagtgctcatcaactgccgcaacaataagaaactcctgggccgcgtgaaggcc 
. ttcgataggcactgcaacatggtgctggagaacgtgaaggagatgtggactgaggtaccc 
aagagtggcaagggcaagaagaagtccaagccagtcaacaaagaccgctacatctccaag 
atgttcctgcgcggggactcagtcatcgtggtcctgcggaacccgctcatcgccggcaag 
taggggccgcctgtctgttgacagaactcactcctctgtcctatgaagaccgctgccatt 
ggtgttgagaata 



Probe Info 



Probe Sequence(5*-3') 


Probe Probe 
X Y 


Probe 
Interrogation 

Position 


Strandedness 


ACCATCATGAGCCTCCTCAACAAGC 


99 


211 


37 


Antisense 


AGTGAGATGACCCCAGAGGAGCTGC 


100 


211 


67 


Antisense 


AACACCGGTCCACTCTCTGTGCTCA 


101 


211 


115 


Antisense 


GGTCCACTCTCTGTGCTCACACAGT 


102 


211 


121 


Antisense 


CTCTCTGTGCTCACACAGTCAGTCA 


103 


211 


127 


Antisense 


GTGCTCACACAGTCAGTCAAGAACA 


104 


211 


133 


Antisense 


TCAGTCAAGAACAATACCCAAGTGC 


105 


211 


145 


Antisense 


AATACCCAAGTGCTCATCAACTGCC 


106 


211 


157 


Antisense 


CAAGTGCTCATCAACTGCCGCAACA 


107 


211 


163 


Antisense 


CGCGTGAAGGCCTTCGATAGGCACT 


108 


211 


205 


Antisense 


AAGGCCTTCGATAGGCACTGCAACA 


109 


211 


211 


Antisense 


TTCGATAGGCACTGCAACATGGTGC 


110 


21.1 


217 


Antisense 


GTACCCAAGAGTGGCAAGGGCAAGA 


111 


211 


271 


Antisense 


TACATCTCCAAGATGTTCCTGCGCG 


112 


211 


325 


Antisense 


TCAGTCATCGTGGTCCTGCGGAACC 


113 


211 


355 


Antisense 


TAGGGGCCGCCTGTCTGTTGACAGA 


114 


211 


397 


Antisense 


TGACAGAACTCACTCCTCTGTCCTA 


115 


211 


415 


Antisense 


CTCCTCTGTCCTATGAAGACCGCTG 


116 


211 


427 


Antisense 


TGTCCTATGAAGACCGCTGCCATTG 


117 


211 


433 


Antisense 


ACCGCTGCCATTGGTGTTGAGAATA 


118 


211 


445 


Antisense 
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Full Record 



-> START ^ 
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Full Screen 



NetAffx Links 



Probe Set ID 
GeneChip 
Array 

Organism 
Common 
Name 



Transcript ID 
Sequence 
Type 
Representative 
Public ID 

Target 
Description 



Target 
Sequence 



Cluster Members 
Consensus/Exemplar 

GeneChip Array Information 

AFFX-BioB-M_st 

HumanGeneFL Array 



Human 



Probe Design Information 



AFFX-BioB-M 

Control sequence \ 
J04423 NCBI 

J04423 E coli bioB gene biotin synthetase (-5, -M, -3 represent transcript regions 
5 prime, Middle, and 3 prime respectively) 

Sequence 

>HUGENEFL:AFFX-BIOB-M_ST 

gccggagttttacggcaatatcatcaccacacgcacttatcaggaacgcctcgatacgct 
ggaaaaagtgcgcgatgccgggatcaaagtctgttctggcggcattgtgggcttaggcga 
aacggtaaaagatcgcgccggattattgctgcaactggcaaacctgccgacgccgccgga 
aagcgtgccaatcaacatgctggtgaaggtgaaaggcacgccgcttgccgataacgatga 
tgtcgatgcctttgattt 



Probe Sequence^-a 1 ) 



Probe Probe 



Probe 

Interrogation strandedness 
Position 



Probe.lnfo 



GATGATATTGCCGTAAAACTCCGGC 201 

TGTGGTGATGATATTGCCGTAAAAC 202 

TAAGTGCGTGTGGTGATGATATTGC 203 

GTTCCTGATAAGTGCGTGTGGTGAT 204 

ATCGAGGCGTTCCTGATAAGTGCGT 205 

GCATCGCGCACTTTTTCCAGCGTAT 206 

GATCCCGGCATCGCGCACTTTTTCC 207 

GACTTTGATCCCGGCATCGCGCACT 208 

CGCCAGAACAGACTTTGATCCCGGC 209 

CCCACAATGCCGCCAGAACAGACTT 210 



483 
489 
497 
505 
513 
536 
543 
549 
559 
569 



Sense 
Sense 
Sense 
Sense 
Sense 
Sense 
, Sense 
Sense 
Sense 
Sense 
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TGCAGCAATAATCCGGCGCGATCTT 


211 


11 


611 


Sense 


TTGCCAGTTGCAGCAATAATCCGGC 


212 


11 


619 


Sense 


CGGCAGGTTTGCCAGTTGCAGCAAT 


213 


11 


627 


Sense 


ATGTTGATTGGCACGCTTTCCGGCG 


214 


11 


656 


Sense 


CACCAGCATGTTGATTGGCACGCTT 


215 


11 


663 


Sense 


TTCACCTTCACCAGCATGTTGATTG 


216 


11 


671 


Sense 


AGCGGCGTGCCTTTCACCTTCACCA 


217 


11 


- 683 


Sense 


CATCATCGTTATCGGCAAGCGGCGT 


218 


11 


700 


Sense 


GCATCGACATCATCGTTATCGGCAA 


219 


11 


707 


Sense 


AAATCAAAGGCATCGACATCATCGT 


220 


11 


716 


Sense 
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Details for HUGENEFL:AFFX-BIODN-5_ST 

Full Slcreen 

Cluster Members 



NetAffx Links 



Probe Set ID 
GeneChip 
Array 

Organism 
Common 
Name 



Transcript ID 

Sequence 
Type 



Consensus/Exemplar 

GeneChip Array Information 

AFFX-BioDn-5_st , 
HumanGeneFL Array; 

Human 



Probe Design Information 



AFFX-BioDn-5 
Control sequence 



ReP pubnUD tiVe ^4423 NCBl 



Target 
Description 



Target 
Sequence 



J04423 E cpli bioD gene dethiobiotin synthetase (-5 and -3 represent transcript 
regions 5 prime and 3 prime respectively). 

Sequence 

>HUGENEFL:AFFX-BIODN-5_ST. 

gggaaaactgtcgccagttgtgcacttttacaagccgcaaaggcagcaggctaccggacg 
gcaggttataaaccggtcgcctctggcagcg'aaaagaccccggaaggtttacgcaatagc 
gacgcgctggcgttacagcgcaacagcagcctgcagctggattacgcaacagtaaatcct 
tacaccttcgcagaacccacttcgccgcacatcatcagcgcgcaagagggcagaccgata 
gaatcattggtaatgagcgccggattacgcgcgcttg 



Probe Info 



Probe Sequence(5 , -3') 


Probe Probe 
X Y 


Probe 
Interrogation 


Strandedness 




Position 




GTGCACAACTGGCGACAGTTTTCCC 


281 11 


49 


Sense 


GGCTTGTAAAAGTGCACAACTGGCG 


282 11 . 


60 


Sense 


GCTGCCTTTGCGGCTTGTAAAAGTG 


283 11 


71 


Sense 


GGTAGCCTGCTGCCTTTGCGGCTTG 


284 11 


79 


Sense 


CCGTCCGGTAGCCTGCTGCCTTTGC 


285 ^;11 


85 


Sense 


CAGCGCGTCGCTATTGCGTAAACCT 


286 . 11 


153 


Sense 


GTAACGCCAGCGCGTCGCTATTGCG 


2&7 11 


160 


Sense 


TTGCGCTGTAACGCCAGCGCGTCGC 


288 11 


167 


Sense 


TGCTGTTGCGCTGTAACGCCAGCGC 


289 11 


172 


Sense 


TGCAGGCTGCTGTTGCGCTGTAACG 


290 11 


179 


Sense 
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TCCAGCTGCAGGCTGCTGTTGCGCT 


291 


11 


185 


Sense 


TGCGTAATCCAGCTGCAGGCTGCTG 


292 


11 


192 


Sense 


TTACTGTTGCGTAATCCAGCTGCAG 


293 


11 


199 


Sense 


CGGTCTGCCCTCTTGCGCGCTGATG 


294 


11 


261 


Sense 


GATTCTATCGGTCTGCCCTCTTGCG 


295 


11 


269 


Sense 


TACCAATGATTCTATCGGTCTGCCC 


296 


11 


276 


Sense 


CTCATTACC AATG ATTCTATCG GTC 


297 


11 


281 


Sense 


TCCGGCGCTCATTACCAATGATTCT 


298 


11 


288 


Sense 


CGCGTAATCCGGCGCTCATTACCAA 


299 


11 


295 


Sense 


CAAGCGCGCGTAATCCGGCGCTCAT 


300 


11 


301 


Sense 
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-> Genotyping 

-> BLAST Status 

-> New Folder 
-> Expression 
Queries 

->{1)A!I Descriptions 

(X15880_at) 
•>(1)AII Descriptions 

(HG4011- 

HT4804_s_at) 
->(1)Ail Descriptions 

(AFFX-BioDn- 

5 st) 

->(f)Ail Desaiptions 
(AFFX-BioB-M_st) 

•>(1)AII Descriptions 
(HG613) 
-> Genotyping 

Queries 



The new GeneChip* One-Cycle and 
Two-Cycte cONA Synthesis Kits. 
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Full Record 



Details for HUGENEFL:X1 5880_AT 
Full Screen 

Cluster Members 
Consensus/Exemplar 

GeneChip Array Information 

X15880 at 



-> START t 



NetAffx Links 



Probe Set ID 
GeneChip 
Array 

Organism 
Common 
Name 



HumanGeneFL Array 



Human 



Probe Design Information 



Transcript ID X15880 
^Type^ Exemplar sequence 

ReP publi^^ 



Target 
Description 



Assembly 
Alignment(s) 



Overlapping 
Transcripts 



X15880, class C, 20 probes, 20 in all_X15880 1690-2273, Human mRNA for 
collagen VI alpha- 1 C-terminal globular domain 

Genomic Alignment of Target Sequence 
April 2003 (NCBI 33) 

Position % Identity Cytoband 

chr21: 46280561-46281145 (+) UCSC 100 q22.3 



Representative 
Transcript 

NM_001848 
NCBI 



UniGene Description Position 

collagen, type VI, alpha chr21:46257869-46281164 (+) 
1 UCSC 



Public Domain and Genome References 

Gene Title collagen, type VI, alpha 1 

Gene Symbol COL6A1 HGNC 

Chromosomal 21q22 3 
Location q 

Hs.41 5997 NCBI (FULL LENGTH) 

ENSG00000142156 Ensembl 

1291 NCBI 

P121Q9 EMBL-EBI 
Q7Z645 EMBL-EBI 
Q8TBN2 EMBL-EBI 
Q9BSA8 EMBL-EBI 

OMIM 120220 NCBI 



UniGene ID 
Ensembl 
LocusLink 

SwissProt 
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RefSeq Protein NPJ) 01839 NCBi 



ID 
RefSeq 



RefSeq Transcript ID RefSeq Title 

NMJ301848 NCBI collagen, type VI, alpha 1 precursor 

Functional Annotations 



Ortholog 



ID 


Title 


Organism 


Type 


MG-U74AV2:162459 F AT 


procollagen, type VI, 


Mouse 


Curated 




alpha 1 




Ortholog 


MG-U74AV2:95493 AT 


procollagen, type VI, 


Mouse 


Curated 




alpha 1 




Ortholog 


MOE430A: 1448590 AT 


procollagen, type VI, 


Mouse 


Curated 


alpha 1 




Ortholog 


MU11KSUBB:X66405 S AT 


procollagen, type VI, 


Mouse 


Curated 




alpha 1 




Ortholog 


MOUSE430 2:1448590 AT 


procollagen, type VI,. 


Mouse 


Curated 




alpha 1 




Ortholog 


MOUSE430A 2:1448590 AT 


procollagen, type VI, 


Mouse 


Curated 




alpha 1 




Ortholog 



Gene Ontology 



GO Biological Process (view graph) 

ID Description 
7155 cell adhesion 

GO Cellular Component (view graph) 

ID Description 
5578 extracellular matrix 

5589 collagen type VI 



Evidence 

non-traceable author 
statement 

Evidence 

inferred from electronic 
annotation 

non-traceable author 
statement 



GO Molecular Function {view graph) 

ID Description 

5194 cell adhesion molecule activity inferred from electronic 

annotation 

5201 extracellular matrix structural 
constituent 



Evidence 



inferred from electronic 
annotation 



Links 

QuickGO 
AmiGO 

Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 



Protein 
Similarities 



Protein 
Domains 



Method 


ID 


Description 


E-Value 


blast 


15011913 




0.0 


blast 


13878903 




0.0 


Database 


ID 


Description 


E-Value 


scop 


dlatza 


d1atza_SCOP:c.62.1.1:| von Willebrand factor 


3.63E- 




A3 domain 


37 


pfam 


ywa 


von Willebrand factor type A domain 


9.6E-24 


pfam 


vwa 


von Willebrand factor type A domain 


4.7E-32 


pfam 


vwa 


von Willebrand factor type A domain 


2.7E-35 


pfam 


Collagen 


Collagen triple helix repeat (20 copies) 


2.4E-11 


pfam 


Collaqen 


Collagen triple helix repeat (20 copies) 


3.8E-14 


pfam 


Collaaen 


Collagen triple helix repeat (20 copies) 


3.3E-10 


pfam 


Collaaen 


Collagen triple helix repeat (20 copies) 


2.6E-11 


InterPro 


IPR008161 


Collagen helix repeat ' 






EMBL-EBI 






InterPro 


IPR002035 


von Willebrand factor, type A 






EMBL-EBI 
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InterPro IPR008160 Collagen triple helix repeat 
EMBL-EBI 



Target 
Sequence 



Sequence 

>HUGENEFL : XI 5 8 8 0_AT 

agcaagacgcctctcggggcctgtgccgcactagcctccctctcctctgtccccatagct 
ggtttttcccaccaatcctcacctaacagttactttacaattaaactcaaagcaagctct 
tctcctcagcttggggcagccattggcctctgtctcgttttgggaaaccaaggtcaggag 
gccgttgcagacataaatctcggcgactcggccccgtctcctgagggtcctgctggtgac 
cggcctggaccttggccctacagccctggaggccgctgctgaccagcactgaccccgacc 
tcagagagtactcgcaggggcgctggctgcactcaagaccctcgagattaacggtgctaa 
ccccgtctgctcctccctcccgcagagactggggcctggactggacatgagagccccttg 
gtgccacagagggctgtgtcttactagaaacaacgcaaacctctccttcctcagaatagt 
gatgtgttcgacgttttatcaaaggccccctttctatgttcatgttagttttgctccttc 
tgtgtttttttctgaaccatatccatgttgctgacttttccaa 



Probe Info 



Probe Sequence(5'-3') 

AGCAAGACGCCTCTCGGGGCCTGTG 
AAACTCAAAGCAAGCTCTTCTCCTC 
AAAGCAAGCTCTTCTCCTCAGCTTG 
TCTCCTCAGCTTGGGGCAGCCATTG 
GCCATTGGCCTCTGTCTCGTTTTGG 
GCAGACATAAATCTCGGCGACTCGG 
GCCCCGTCTGCTGAGGGTCCTGCTG 
TGGCCCTACAGCCCTGGAGGCCGCT 
TCAGAGAGTACTCGCAGGGGCGCTG 
AGTACTCGCAGGGGCGCTGGCTGCA 
GGCGCTGGCTGCACTCAAGACCCTC 
GGACATGAGAGCCCCTTGGTGCCAC 
GAGAGCCCCTTGGTGCCACAGAGGG 
CCCTTGGTGCCACAGAGGGCTGTGT 
GTGCCACAGAGGGCTGTGTCTTACT 
CAGAGGGCTGTGTCTTACTAGAAAC 
CTCCTTCCTCAGAATAGTGATGTGT 
TTTTTCTGAACCATATCCATGTTGC 
TGAACCATATCCATGTTGCTGACTT 
ATATCCATGTTGGTGACTTTTCCAA 



Probe Probe 
X Y 


Probe 
Interrogation 

Pncitinn 


Stranded nesi 


76 


317 






77 


317 






78 


317 


1810 


Antisense 


79 


317 


1822 


Antisense 


80 


317 


1840 


Antisense 


81 


317 


1888 


Antisense 


82 


317 


1912 


. Antisense 


83 


317 


1954 


Antisense 


84 


317 


2002 


Antisense 


85 


317 


2008 


Antisense 


86 


317 


2020 


Antisense 


87 


317 


2104 


Antisense 


88 


317 


2110 


Antisense 


89 


317 


2116 


Antisense 


90 


317 


2122 


Antisense 


91 


317 


2128 


Antisense 


92 


317 


2164 


Antisense 


93 


317 


2248 


Antisense 


94 


317 


2254 


Antisense 


95 


317 


2260 


Antisense 
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search site 



AFFYMETRIX^ 



PRODUCTS 



:: GETTING 
STARTED 

-> Wizard 

» QUERY 

Expression 

-> Quick Query 
-> Standard Query 
-> Batch Query 
-> BLAST 
-> Probe Match 
->UCSC Query . 
Genotyping 
-> Quick Query 
-> Standard Query 
-> Batch Query 
-> UCSC Query 
-> SNP Finder 

CURRENT QUERY 
1 probe sets 

. -> Annotations 
-> Show Orthologs 
-> GO Browser 
-> Export 

:: QUERY HISTORY 

Annotation Views 

-> Expression 
-> Genotyping 

-> BLAST Status 

-> New Folder 
-> Expression 
Queries 

->(1)Ail Descriptions 

(U23752_at) 
->(1)AII Descriptions 

(HG1800- 

HT1823_at) 
->(1)AII Descriptions 

(U15008_at) . 
->{1)AII Descriptions 

(HG3523) 
->all probe sets 

(7129) 
-> Genotyping 
Queries 



% / ,2004 Affymetrix ®MM%®m 
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-> START ) 



Full Record 



Details for HUGENEFL:U23752_AT 
Full Screen 

, . , Cluster Members 
NetAffx Links CQnsensus/Ex em P | ar . 

GeneChip Array Information 

Probe Set ID U23752_at 

GeneChip HumanGeneF L Array 
Array 



Organism 
Common 
Name 



Human 



Probe Design Information 



Transcript ID U23752 
^Type^ Exemplar sequence 

R T e Kr n m [Ve U23752NCBI 
Public ID 

Target U23752, class A, 20 probes, 20 in U23752 1679-1919, Human SOX-11 mRNA, 
Description complete cds 

Genomic Alignment of Target Sequence 
Assembly April 2003 (NCBI 33) 

Position % Identity Cytoband 

Alignment(s) chr2: 58 56 192-5856457 (+) UCSC 99 p25.2 



Overlapping 
Transcripts 



Gene Title 



Representative UniGene Description Position 

Transcript 

NM_003108 SRY (sex determining region Y)- chr2:5854537-5863255 (+) 

NCBI box 11 UCSC 



Public Domain and Genome References 
SRY (sex determining region Y)-box 1 1 



Gene Symbol SOX1 1 HGNC 
Chromosomal 
Location 



2p25 

. UniGene ID Hs.432638 NCBI (FULL LENGTH) 
Ensembl ENSG000001 76887 Ensemb| 
LocusLink 6664 NCBI 
SwissProt P35716 EMBL-EBI 
OMIM 600898 NCBI 
RefSeq^Protein N p_ 0Q3Q99 NCB{ 
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RefSeq 



Ortholog 



RefSeq Transcript ID RefSeq Title 
NMJ)03108 NCB1 SRY-box 1 1 

Functional Annotations 

ID Title Organism 

RAE230A; 1387275 AT SRY-box containing gene Rat 
11 

RG- SRY-box containing gene Rat 

U34A:AJ004858 AT 11 

GO Biological Process (view graph) 

ID Description Evidence 

6355 regulation of transcription, DNA- inferred from electronic 

dependent 
7399 neurogenesis 



Gene Ontology 



Protein 
Similarities 



Protein 
Domains 



annotation 

traceable author 
statement 



GO Cellular Component (view graph) 

ID Description 
5634 nucleus 

GO Molecular Function (view graph) 

ID Description 
3677 DNA binding 



Evidence 

inferred from electronic 
annotation 



Evidence 

inferred from electronic 
annotation 



Type 

Putative 
Ortholog 

Putative 
Ortholog 

Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

Links 

QuickGO 
AmiGO 

Links 

QuickGO 
AmiGO 



Method ID Description E-Value 

blast 4507161 SRY-box 1 1 ; SRY (sex-determining 0.0 

region Y)-box 11; SRY-related 
HMG-box gene 1 1 ; transcription 
factor SOX-11 [Homo sapiens] 



blast 

Database 

scop 

pfam 
inter Pro 



23831472 

ID Description 
d1i11a d1i11a_SCOP:a.21.1.1:| Sox-5 

HMG box HMG (high mobility group) box 
IPR000910 HMG1/2 (high mobility group) box 
EMBL-EBI 



0.0 



E-Value 

2.36E- 

1.1E-33 



Target 
Sequence 



Sequence 

>HUGENEFL : U2 3 7 5 2_AT 

cttcctttatcgtgtctcaaggtagttgcatacctagtctggagttgtgattattttccc 
aaaaaatgtgtttttgtaattactatttctttttcctgaaattcgtgattgcaacaaagg 
cagagggggcggcgcggcggaggggaggtaggacccgctccggaaggcgctgtttgaagc 
ttgtcggtctttgaagtctggaagacgtctgcagaggacccttttggcagcacaactgtt 
actctagggagttggtggagatatt 



Probe Sequence(5'-3') 


Probe 
X 


Probe 
Y 


Probe 
Interrogation 

Position 


Strandedness 


CTTCCTTTATCGTGTCTCAAGGTAG 


503 


219 


1691 


Antisense 


TTATCGTGTCTCAAGGTAGTTGCAT 


504 


219 


1697 


Antisense 


TCGTGTCTCAAGGTAGTTGCATACC 


505 


219 


1700 


Antisense 


AAGGTAGTTGCATACCTAGTCTGGA 


506 


219 


1709 


Antisense 


GTAGTTGCATACCTAGT.CTGGAGTT 


507 


219 


1712 


Antisense 
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GTTGCATACCTAGTCTGGAGTTGTG . 


oUo 




A7i ^ 


Antisense 


TACCTAGTCTGGAGTTGTGATTATT 


CAQ 


zi y 


1 791 


Antlconco 
nnifoci loc 


CTAGTCTGGAGTTGTGATTA 1 I 1 IC 


01 U 


^iy . 




Antisense 


TGTGATTAl 1 I I CCCAAAAAATGTG 


01 1 


^iy 


1 ( OO 


Antisense 


MM CCTGAAATTCGTGATTGCAAC 


512 


219 


1 7o1 


Antisense 


GCTCCGGAAGGCGCTGTTTGAAGCT 


513 


2i y 




Antisense 


GGTGTTTGAAGCTTGTCGGTCTTTG 


514 


219 


1859 


Antisense 


Probe Info . TGAAGCTTGTCGGTCTTTGAAGTCT 


515 


219 


. 1865 


Antisense 


TTGTCGGTCTTTGAAGTCTGGAAGA 


516 


219 


-1 0"7«< 

18/1 


Antisense 


TGGAAGACGTCTGCAGAGGACCCTT 


517 


219 


looy 


Antisense 


AAGACGTCTGCAGAGGACCC 1 1 1 1 G 


olo 


O-JQ 

iiiy 


A flG9 


A nticDnco 


GCAGAGGACCC 1 1 1 I GGCAGCACAA 


519 


219 


1901 


Antisense 


AGCACAACTGTTACTCTAGGGAGTT 


520 


219 


1919 


Antisense 


ACTGTTACTCTAGGGAGTTGGTGGA 


521 


219 


1925 


Antisense 


• ACTCTAGGGAGTTGGTGGAGATATT 


522 


219 


1931 


Antisense 
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AFFYMETRIX 

ax:--: 



2004 




PRODUCTS 



:: GETTING 
STARTED 

-> Wizard 

:: QUERY 

Expression 
-> Quick Query 
-> Standard Query 
-> Batch Query 
-> BLAST 
-> Probe Match 
-> UCSC Query 

Genotyping 
-> Quick Query 
-> Standard Query 
-> Batch Query 
-> UCSC Query 
-> SNP Finder 

:: CURRENT QUERY 
1 probe sets 

-> Annotations 
-> Show Orthologs 
-> GO Browser 
-> Export. 

QUERY HISTORY 

Annotation Views 

-> Expression 
->'Genotyping 

-> BLAST Status 

'-> New Folder 
^Expression 
Queries 

->(1 ^Descriptions 

(M12625_at) 
->{1)A!I Descriptions 

(U23752_at) 
->(1)A1I Descriptions 

(HG1800- 

HT1823 at) 
■>(1)Ali Descriptions 

(U15008_at) 
->(1)A!I Descriptions 

(HG3523) 
-> Genotyping 
Queries 
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Full Record 



Details for HUGENEFL:M12625_AT 
Full Screen 

Cluster Members 
Consensus/Exemplar 

GeneChip Array Information 

M 12625 at 



-> START } 



NetAffx Links 



Probe Set ID 
GeneChip 
Array 

Organism 
Common 
Name 



Transcript ID 
Sequence 
Type 



HumanGeneFL Array 



Human 



Probe Design Information 



M12625 

Exemplar sequence 



Representative ^ NCB| 
Public ID 



Target 
Description 



Assembly 
Alignment(s) 



Overlapping 
Transcripts 



Gene Title 

Gene Symbol 

Chromosomal 
Location 

UniGene ID 

Ensembl 

LocusLink 

SwissProt 

EC 
OMIM 



M12625, class B, 20 probes, 13 in M12625mRNA 893-1259: 7 in 
reverseSequence, 1599-1683, Human lecithin-cholesterol acyltransferase mRNA, 
complete cds, with 5* and 3' flanking DNA sequences 

Genomic Alignment of Target Sequence 
April 2003 (NCBl 33) 

Position % Identity Cytoband 

Chr16: 67749925-67750484 .(-) UCSC 100 q22.1 



M 12625 NCBl 



Representative Un iGene Description 
Transcript 

lecithin-cholesterol 
acyltransferase 

Public Domain and Genome References 
lecithin-cholesterol acyltransferase 
LCAT HGNC 

16q22.1 

Hs.387239 NCBl (FULL LENGTH) 
ENSG000001 03080 Ensernbj : : . ; 

3931 NCBl - ;. y. 

AAP88750 EMBL-EBl 
PQ4180 EMBL-EBl 

2.3.1.43 
606967 NCBl 



Position 

chrl 6:67749888-67754507 (-) 
UCSC . 
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RefSeq Protein NPJ)0 o220 NCBI 



RefSeq 



Ortholog 



Gene Ontology 



Protein 
Similarities 



Protein 
Families 



RefSeq Transcript ID RefSeq Title 

NM_000229 NCBI lecithin-cholesterol acyltransferase precursor 

Functional Annotations 

ID Title Organism Type 

lecithin cholesterol Mouse Curated 

acyltransferase Ortholog 

lecithin cholesterol Mouse Curated 

acyltransferase Ortholog 

lecithin cholesterol Mouse Curated 

acyltransferase Ortholog 

MU11KSUBA:JQ5154 S AT lecithin cholesterol Mouse Curated 

acyltransferase Ortholog 

lecithin cholesterol Rat Curated 

acyltransferase ■ Ortholog 

lecithin cholesterol ■ Rat Curated 

acyltransferase Ortholog 

lecithin cholesterol Mouse Curated 

acyltransferase Ortholog 

MOUSE43QA 2:1417043 AT lecithin cholesterol Mouse Curated 

acyltransferase Ortholog 

GO Biological Process (view graph) 

ID Description 
6629 lipid metabolism 



MG-U74AV2:103023 AT 
MG-U74AV2:161759 R AT 
MOE43QA:1417043 AT 



RAE230A: 1367887 AT 
RG-U34A:X54096 AT 
MOUSE43Q 2:1417043 AT 



Evidence 

inferred from 
electronic annotation 



Links 

QuickGO 
AmiGO 



GO Cellular Component (view graph) 

ID Description 
5576 extracellular 

GO Molecular Function (view graph) 
ID Description 

4607 phosphatidylcholine-sterol O- 

acyltransferase activity 
841 5 acyltransferase activity 

16740 transferase activity 



Evidence 
not recorded 



Evidence 

inferred from 
electronic annotation 

inferred from 
electronic annotation 

inferred from 
electronic annotation 



Links 

QuickGO 
AmiGO 



Links 

QuickGO 
AmiGO 

QuickGO 
AmiGO 

QuickGO 
AmiGO 



Method 

blast 
blast 

Method 



ID 

32879837 
4557892 

ID 



Description 

lecithin-cholesterol acyltransferase precursor 
[Homo sapiens] 

Description 



E-Value 

0.0 

0.0 



ec LCAT HUMAN LCATJHUMAN 

EC:2.3.1.43:PHOSPHATIDYLCHOLINE- 
STEROL ACYLTRANSFERASE PRECURSOR 
(EC 2.3.1.43) (LECITHIN-CHOLESTEROL 
ACYLTRANSFERASE) (PHOSPHOLIPID- 
CHOLESTEROL ACYLTRANSFERASE). 



E- 
Value 

1.85E- 
17t> 



Database 



ID 



Description 



E-Value 
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Protein 
Domains 



scop 
pfam 



dltca d1tca_SCOP:c.69.1.17:| Triacy [glycerol lipase 5.3E-8 

LACT Lecithinxholesterolacyltransferase 1.7E- 
7 182 

InterPro IPR003386 Lecithinxholesterol acyltransferase 
EMBL-EBI 

InterPro IPR008262 Lipase, active site 
EMBL-EBI 



Target 
Sequence 



Trans Membrane 
ID 

NP 000220 



Number Of 
Domains 



Probability of Interior N-Terminus 
0.05945 



Sequence 

. >HUGENEFL : Ml 2 6 2 5_AT 

cttcaactacacaggccgtgacttccaacgcttctttgcagacctgcactttgaggaagg 
ctggtacatgtggctgcagtcacgtgacctcctggcaggactcccagcacctggtgtgga- 
agtatactgtctttacggcgtgggcctgcccacgccccgcacctacatctacgaccacgg 
cttcccctacacggaccctgtgggtgtgctctatgaggatggtgatgacacggtggcgac 
ccgcagcaccgagctctgtggcctgtggcagggccgccagccacagcctgtgcacctgct 
gcccctgcacgggatacagcatctcaacatggtcttcagcaacctgaccctggagcacat 
caatgccatcctgctgggtgcctaccgccagggtccccctgcatccccgactgccagccc 
agagcccccgcctcctgaataaagaccttcctttgctaccgtaagccctgatggctatgt 
ttcaggttgaagggaggcactagagtcccacactaggtttcactcctcaccagccacagg 
ctcagtgctgtgtgcagtg 



Probe Info 



Probe Sequence^' -3') 


Probe 
X 


Probe 
Y 


Probe 
Interrogation 


Strandedness 






Position 




CTTCAACTACACAGGCCGTGACTTC 


152 


127 


1161 


Antisense 


CTACACAGGCCGTGACTTCCAACGC 


153 


127 


1167 


Antisense 


CCAACGCTTCTTTGCAGACCTGCAC 


154 


127 


1185 


Antisense 


CCTGCACTTTGAGGAAGGCTGGTAC 


155 


127 


1203 


Antisense 


CATGTGGCTGCAGTCACGTGACCTC 


156 


127 


1227 


Antisense 


GCTGCAGTCACGTGACCTCCTGGCA 


157 


127 


1233 


Antisense 


CCTGGCAGGACTCCCAGCACCTGGT 


158 


127 


1251 


Antisense 


GGACCCTGTGGGTGTGCTCTATGAG 


159 


127 


1353 


Antisense 


TGTGCTCTATGAGGATGGTGATGAC 


160 


127 


1365 


Antisense 


GGCGACCCGCAGCACCGAGCTCTGT 


161 


127 


1395 


Antisense 


GCTGACCCTGGAGCACATCAATGCC 


162 


127 


1503 


Antisense 


GCACATCAATGCCATCCTGCTGGGT 


163 


127 


1515 


Antisense 


CATCCTGCTGGGTGCCTACCGCCAG 


164 


127 


1527 


Antisense 


CTTTGCTACCGTAAGCCCTGATGGC 


165 


127 


1611 


Antisense 


TACCGTAAGCCCTGATGGCTATGTT 


166 


127 


1617 


Antisense 


AAGCCCTGATGGCTATGTTTCAGGT 


167 


127 


1623 


Antisense 


CTATGTTTCAGGTTGAAGGGAGGCA 


168 


127 


1635 


Antisense 


GGAGGCACTAGAGTCCCACACTAGG 


169 


127 


1653 


Antisense 


GTCCCACAGTAGGTTTCACTCCTCA 


170 


127 


1665 


Antisense 


GACAGGCfCAGTGCTGTGTGCAGTG 


171 


,127 


1695 


Antisense 
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